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I ' Abstract. We consider a stochastic model for species evolution. 

A new species is born at rate A and a species dies at rate /i. A 
random number, sampled from a given distribution F, is associated 
with each new species at the time of birth. Every time there is a 
death event, the species that is killed is the one with the smallest 
Pm ' fitness. We consider the (random) survival time of a species with 

a given fitness /. We show that the survival time distribution 
I depends crucially on whether f < fa f — fc oi f > fc where fc is 

. a critical fitness that is computed explicitly. 



f-^ . 1. Introduction 

oo 

CN ■ Consider a stochastic model for species evolution in which a new 

(N 

O ■ species is born at rate A and an existing species dies at rate /x. A ran- 

dom number, sampled from a given distribution F, is associated with 
^ ! each new species at the time of birth. We think of the random num- 

- ■ ■ ber associated with a given species as being the fitness of the species. 

These fitnesses are independent of each other and of everything else 
in the process. Every time there is a death event, the species that 
is killed is the one with the smallest fitness. We assume F to be an 
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absolute continuous distribution function. In this paper we study the 
survival time of a given species with fitness /. We show that there is a 
critical fitness /c and a sharp phase transition for the survival time of 
the species. Our analysis is based on a closed connection between our 
model and random walks. 

A similar model, meant to build phylogenetic trees, was introduced 
in Liggett and Schinazi |7j. A discrete version of this model is studied 
in Guiol et al [6J where a phase transition is shown. 



Recall we assume that F is an absolute continuous distribution func- 
tion. This implies that there exists a probability density on M such 
that 



Denote Supp(F) the support of distribution F: 

Supp(F) = {x G M : <^{x) > 0} 

Assume that originally there are k species in the process, with associate 
fitness /i < /2 < ... < = / in the support of F. Denote the r.v 
corresponding to the survival time of the species with fitness / in that 
context. 

Theorem 2.1. The survival time Tf has a Bessel distribution : 



2. Main Results 





(2.1) 
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with Xf := AF(/) < A and where Ik is the modified Bessel function of 
the first kind with indice k defined by 

Remark 2.2. Devroye p] chapter IX section 7 p. 470] includes distri- 
bution (12. ip among the class of Bessel function distributions. One can 
also find it in Feller [U chapter II section 9 Problem 15 p. 65] without 
a specific name. 

Remark 2.3. The survival time is not affected by living species 
with fitness above /. 

Remark 2.4. Whenever A/ < /i, has probability density 

/ \ fc/2 , 

Mt) = e-''-h{2^f t) for t > 0. 

In the case Xf > fi the previous function is not a density probability 
since 

f{3'-.-i«vs«*=(a'-(j)"={fe)'<. 

Though formula (12. ip gives the exact distribution of the survival 
time, it is not straightforward to come to a conclusion from it. Next 
result sheds light in the phase transition property of our model. 

Corollary 2.5. With the previous notation 
(a) If Xf < fj, then 



--yt 
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(b) If Xf > fi then 

— 7t 

P(+oo>r^^>t) ~ 

(c) If Xf = fj, then 

F{Tf > t) ~ A;(7r^t)-i/l 

Remark 2.6. Note, from Corollary 12. 5[ that if A > /i there is a phase 
transition in /. A species born with a fitness lower than 

/, := F'\^,/X) (2.3) 

dies out exponentially fast while a species with a fitness greater than fc 
has a positive probability of surviving forever. The larger A//i (recall 
that is non-decreasing) the more welcoming the environment is to 
new species. If A//i < 1 all species will die exponentially fast. On the 
other hand if A//x is large then even species with relatively low fitness 
will make it. 

Up to now we have discussed the survival of a species with a given 
fitness /. It is particularly relevant to derive some information about 
the distribution of these surviving species. Suppose that A > /i and 
let Lt and Rt be the sets of species alive at time t whose fitness is 
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respectively lower than fc and higher than fc. Since each fitness that 
has appeared up to time t will not show up again a.s., we can identify 
each species with its fitness and think of Lt and Rt as sets of points in 
(—00, fc) and {fc, 00) respectively. Next result states a straightforward 
application of the main result of Guiol et al [6] 



Proposition 2.7. Suppose that \> ji. Then 

(a) The number \Lt\ of species whose fitness is below fc is a null 
recurrent birth and death process. In particular, the set Lt is empty 
infinitely often with probability one. 

(b ) Let fc<a<b then 

1,„ , X{F{b)-F{a)) 
hm -\Rt n (a, 6)1 = ^ \^ ^— a.s. 

t^oo t A + /i 



Proof of Proposition \2. 7\ To see this observe that the embedded dis- 



crete Markov chain for our process is the stochastic model of evolution 
defined in Guiol et al |H] in such a way that p = A/(A + fi). Consider 
that whenever the total number of species is 0, the death marks (see 
page[7]below) in the construction of the process are ignored so the total 
number of species stays with probability fi/{X + fi). □ 



Remark 2.8. Observe that from Ben Ari et al. [2] it would be also 
possible to get a Central Limit Theorem and a Law of the Iterate 
logarithm for Rf. 
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3. Proofs 

3.1. Construction of the process. The construction uses ideas from 
Harris Graphical method for Markov Processes and basically take ad- 
vantage from projections properties of a bi- dimensional Poisson pro- 
cess with rate 1. In the sequel we construct a bi-variate process Zi = 
{Zl^Zf) in which Z} will represent the number of living species at 
time t, and Zf will be the set of associated living fitness: In particular 
\Z^ \ = Z^, where \A\ denotes the cardinal of set A. 

Let M be a two dimensional Poisson process with rate 1 on x M. 
For notational convenience we will identify the x-line of the plane as 
the time line. 

Suppose we start the process with k > 1 species, let /i,...,/^ be k 
independent random variables with F distribution, independent from 
M. 

Let To = and Zq = {Z^, Z^) = {k, {/i, /J) G N x S where S is 
the set of finite subsets of real numbers in [0, 1]^. 
Define 

Ti = inf{t > : M([0,t] x [0,\ + fi]) > 0} (3.1) 

that is the first time t G M"*" that a Poisson mark falls into the strip 
X [0, A + /i]. Denote by (Ti, Yi) the coordinate of the Poisson mark 
realizing the infimum in f l3.ip . Observe that from the Poisson process 
properties Yi is a uniform [0, 1] r.v. independent of Ti. 
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• Whenever Yi G [0, A] then let fk+i = F(Yi/X) (observe that 
/fe+i is also a r.v. with F law and independent of Ti) and let 

Zt, = {Zl + 1, Zl U {h+i}) = (fc + 1, {/o, /fe+i}) 

this will represent the birth of a new species; 

• else, whenever Yi e]A, A + let 

Zt, = {Zl Zl \ min{Z2}) - - 1, {/o, fk] \ min{/, : 1 < z < A;}) 

this will represent the death of the weakest species. 

For all t e [0, Ti [ denote Zt — Zq. We have thus construct the process 
Zl until time Ti (included). 

For all n > 1 denote by T„ the time of the n-th mark of the Poisson 
process M in the strip M"*" x [0, A + \j\ that is 

T; = inf{t > T„_i : M([0,t] x [0, A + > 0}. 

Suppose the process Zt is construct up to time T^, n > 1. As before 
denote by (r„+i, i^+i) the coordinate of the n + 1st Poisson mark. 

• Whenever y„+i e [0, A] then let fk+n+i — F{Yn+i/X) and define 

-^r„+i = (-^T„ + 1, -^T„ U {/fe+„+i}) 

• else 

^r„+i = (^T„ - l{zi^>o}, \ min{Z|J). 
with the convention min0 = 0; 
then for all t e [T^, let Zt = Zt„. 
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So by induction one can construct the process {Zt)t>o so that the 
second coordinate of i.e. Z| represents our fitness process starting 
with k species. 

3.2. An useful coupling. From the previous construction one can 
couple Zt — [Z], Zj) with another process e N as follows. 

Let Xq = Zq (with the construction's notation) and denote by / = 
max{/i, ...,fk}- At time Ti 

• if Yi e [0, A/]U]A, A + n] then let Xt^ = Z^^. Observe that this 
corresponds to a simultaneous death or to a simultaneous birth 
with associate fitness less than / for the Z process; 

• else (when Yi e]A/,A]) then let Xt^ = -'^o- In this case there 
is a birth on the Z process with associate fitness bigger than / 
and nothing for the X process. 

As before define Xt = Xq for all t e [0, Ti[. 
For all set A of numbers in [0, 1] denote by 



MA) = {xeA:x<f}, 



(3.2) 



i.e. the set of numbers in A less or equal to /. 



Observe that Xt = Z^ 



on [0,T,[, Xt, = \MZt,)\ < Z, 



For n > 1 suppose that Xf is construct up to time 



IfXT„^Othen \MZI)\^Xt^ 



n 
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• if Yn+i G [0, A/] (recall that fk+n+i ■= F{Yn+i/X) < f) define 

• if Yn+i G [\^\ + ji] then define 

• else let = Xr„; 

In case Xt^ = let Xt„+i = 0. 

This defines a random sequence (Tn, XT„)n>o, we define the process 

Xt = XT^ for alHG [T,,T„+i[. 
The proof of Theorem 12.11 relies on the following Lemma. 

Lemma 3.1. For any k > 1 

{rf >t} = {Xt > 0} (3.3) 

i.e. has the same law as the first passage time to of Xt the simple 
Bernoulli random walk starting at k with rate c = Xf + fi and individual 
steps equal to 1 or —1 with respective probability p = X/c and q = ii/c. 

Proof of Lemma\3Ji We have Xq = = \(f)f{Z'^)\ = k > 0. From the 
construction for all t < Tj we have minZf < f this implies > 0. 
Conversely if < t as min Z^^. > f this implies X^k = and thus 
XI = 0. □ 

Proof of Theorem \2.1[ Let (T„)„>i denotes the jump times of the pro- 
cess {Xt)t>o and set Tq = 0. The random sequence (X2'„)n>o is a simple 
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discrete time random walk on N with individual steps equal to 1 or —1 
with respective probability p and q. Denote by Hq the first hitting time 
of of this walk. A standard computation (see for instance Grimmett- 
Stirzaker |5i (15) p. 79]) gives 

whenever n + A; is even, otherwise. As T„ has a Gamma distribution 
with parameters c and n this implies that 

ft °° n 

P(Xi = 0) = / y^- —u'^-h-'^FiHo = n\Xo = k) du 

Jo u {i + m^'""^ ^ p 

and from the definition of the Bessel function ( 12. 2p 

P(Xi = 0) = / e-^"- ^ Wcu^) 
Jo \P/ 

□ 



Remark 3.2. Let rj = We have 



k 

7 



Observe that are a.s. finite stopping times and from the Strong 
Markov property (rj — Tj"^)i<j<fc is an i.i.d. sequence of r.v. with the 
distribution of r/ := rj. 
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As observed in Remark 12.41 Xf < fj, the expression (12. ip gives the 
density probabihty of r^: 



Xt) = y^e-(-+^/)*i/i(2v/M7t) 



for t > 0. Which allows to compute its Moment Generating Function: 

2fx 



M{s) = E(e" 



■TfS 



iy/(s + /i + A/)2 - AfiXf + s + fi + Xf 
This in turns allows us to compute E(rj) = j^^- So that one can see 

easily that 



E(rf ) = k 



for all A; > 0. 

Proof of Corollary \2.5\ (a). When Xf < fi (12. ip reads 

/ ,, \ r+oo 7 

P(r/ >t)= i^-^J e~^^+^f^^-h{2^fu) du 

Since (see Arfken and Weber, of [U section 11.6 eq.11.137 p. 719]) for 
all > 1 

, 4A;2 - 1^ ^ ^ ^ , 
7^(1 - ) < 4(0:) < ^= (3.4) 

for X large enough, 

1 e-(v^-v^)'" / 4A;2 - 1 



20F(/xA/)V4 «3/2 ^ 16v^ 
g-(M+A/)« 

< hi^y^f U) < 



U 



1 g-Cv^-v^/) " 
2v^(/iA/)V4 ^^^7^ 



(3.5) 
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\2 



also for X large enough. Denoting 7 = (y/JI — y A/) , observe that 

11 4P - 1 1 \ e-^" 1 e-^* 

e ^ < / -nr ^« < TTiTT- (3-6) 



Thus 



7 t3/2 272 t5/2 y - ^3/2 - ^ ^3/2 



]_ g-7* 

^ "^""^ 7tW' 
so we have a sharp asymptotic estimate for the integral of the upper 

bound in f l3.5p . 

For the integral of the lower bound, denoting a = (Ak^—l) / {16^/JIXf), 
just observe that 

to see that we also have a sharp asymptotic estimate for the integral 
of the lower bound in (13.51) . Besides, both asymptotic estimates agree. 
Plugging f l3.7p and fl3.6p into fl3.5p and then into (12. ip we finally 

concluded that for t large enough 

X ^ e-(^/^^-^A7)2^ 



□ 

Proof of Corollary \2.5\ (b). This immediate from the preceding compu- 
tations and Remark 12.41 □ 

Proof of Corollary \2.5\ (c). When Xf = ji (12. ip reads 

P(r^ >t)= e-2'^"-4(2/xM) rfw 

using in turn inequalities (13. 4 p leads directly to the result. □ 
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